Prediction of protein-protein interaction sites using support vector machines

نویسندگان

  • Asako Koike
  • Toshihisa Takagi
  • Gideon Schreiber
چکیده

The identification of protein-protein interaction sites is essential for the mutant design and prediction of protein-protein networks. The interaction sites of residue units were predicted using support vector machines (SVM) and the profiles of sequentially/spatially neighboring residues, plus additional information. When only sequence information was used, prediction performance was highest using the feature vectors, sequentially neighboring profiles and predicted interaction site ratios, which were calculated by SVM regression using amino acid compositions. When structural information was also used, prediction performance was highest using the feature vectors, spatially neighboring residue profiles, accessible surface areas, and the with/without protein interaction sites ratios predicted by SVM regression and amino acid compositions. In the latter case, the precision at recall = 50% was 54-56% for a homo-hetero mixed test set and more than 20% higher than for random prediction. About 30% of the residues wrongly predicted as interaction sites were the closest sequentially/spatially neighboring on the interaction site residues. The predicted residues covered 86-87% of the actual interfaces (96-97% of interfaces with over 20 residues). This prediction performance appeared to be slightly higher than previously reported study. Comparing prediction accuracy of each molecule, it seems to be easier to predict interaction sites for stable complexes. 3 INTRODUCTION Proteins perform a biological function by interacting with other proteins, compounds, RNA,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Protein-Protein Interaction Sites Using Support Vector Machines

The identification of protein-protein interaction sites is essential for the mutant design and prediction of protein-protein networks. The interaction sites of residue units were predicted using support vector machines (SVM) and the profiles of sequentially/spatially neighboring residues, plus additional information. When only sequence information was used, prediction performance was highest us...

متن کامل

A Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels

The limiting velocity in open channels to prevent long-term sedimentation is predicted in this paper using a powerful soft computing technique known as Extreme Learning Machines (ELM). The ELM is a single Layer Feed-forward Neural Network (SLFNN) with a high level of training speed. The dimensionless parameter of limiting velocity which is known as the densimetric Froude number (Fr) is predicte...

متن کامل

The identi®cation of protein±protein interaction sites is essential for the mutant design and prediction of protein± protein networks. The interaction sites of residue units

The identi®cation of protein±protein interaction sites is essential for the mutant design and prediction of protein± protein networks. The interaction sites of residue units were predicted using support vector machines (SVM) and the pro®les of sequentially/spatially neighboring residues, plus additional information. When only sequence information was used, prediction performance was highest usi...

متن کامل

Prediction of Protein-Protein Interaction Sites with Two-Stage Support Vector Machine

Protein-protein interactions play an important role in a number of biological processes such as DNA replication and repair, transcription, metabolism, and signal transduction cascade. To deeply understand protein-protein interactions, engineer proteins, and design drugs, we need to analyze detailed interaction mechanisms at the atomic level. Many protein complex structures have previously been ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004